1626 by christopherkarani · Pull Request #25 · christopherkarani/Conduit

christopherkarani · 2026-02-19T19:58:02Z

No description provided.

Introduces a new SPM C target (ConduitCore) that replaces hot-path pure-Swift implementations with C for 3-10x performance on Apple platforms via Accelerate/vDSP, with scalar fallbacks for Linux. Components: - conduit_vector_ops: dot product, cosine similarity, euclidean distance, batch cosine similarity — wired to EmbeddingResult - conduit_json_repair: single-pass JSON repair on raw UTF-8 bytes — wired to JsonRepair with Swift fallback - conduit_json_completer: recursive descent partial JSON completion — wired to PartialJSONDecoder with Swift fallback - conduit_sse_parser: incremental W3C-compliant SSE parser (not wired — preserved for cross-platform portability in Phase 3) - conduit_line_buffer: growable buffer with memchr-based newline scanning (not wired — same rationale) Also fixes pre-existing test compilation errors: - GeneratedContent.null → GeneratedContent(kind: .null) - GeneratedContent.object → GeneratedContent(kind: .structure) - DiffusionModelDownloaderTests #if guard → CONDUIT_TRAIT_MLX 79 new tests across 5 ConduitCoreTests files. 607 total tests pass with 0 regressions from these changes. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

chatgpt-codex-connector · 2026-02-19T19:58:07Z

You have reached your Codex usage limits for code reviews. You can see your limits in the Codex usage dashboard.

christopherkarani · 2026-02-19T19:58:19Z

@codex @claude @sisyphus-dev-ai review

chatgpt-codex-connector · 2026-02-19T19:58:31Z

You have reached your Codex usage limits for code reviews. You can see your limits in the Codex usage dashboard.

claude · 2026-02-19T19:58:36Z

Claude finished @christopherkarani's task in 1m 58s —— View job

PR Review: Branch `1626` → `main`

Summary

This PR introduces three major areas of change:

ConduitCore — a new C performance library (5 modules)
ChatSession refactoring — history management extracted to ChatSession+History.swift, warmup support added
Comprehensive test suite — ~50 new test files covering core types, providers, and the C library

Overall code quality is high. The C library is clean, the Swift refactoring is well-structured, and the tests are thorough. A few issues worth addressing before merge are noted below.

ConduitCore C Library

conduit_vector_ops.c — Good use of vDSP with scalar fallback. One note:

Potential issue [conduit_vector_ops.c:56]: conduit_euclidean_distance calls malloc inside the Accelerate path for a temp buffer. On allocation failure it falls back to scalar — correct behavior, but this is the only function in the file that heap-allocates. The header comment says "no heap allocation unless explicitly documented"; this should be documented, or alternatively an alloca/VLA pattern used for small vectors.

conduit_json_repair.c — Solid single-pass implementation.

Correctness concern [conduit_json_repair.c:242]: The bracket_stack is declared with a fixed size of 256, but effective_max is derived from max_depth < 256. The guard effective_max = max_depth < 256 ? max_depth : 256 is correct, but the capacity_for_content calculation at line 248 reserves effective_max + 2 bytes for closers — this could be 1 too few if the stack is full and requires a NUL terminator. Worth adding a +1 guard in the assertion.
Missing #include <stdlib.h>: malloc / free are not used in this file, but for portability and correctness, headers for all used symbols should be present. (Actually this file doesn't use malloc — the issue is that remove_trailing_commas is called with potentially aliased pointers — output is both input and output — and the comment at line 166 acknowledges in-place use. The alias is safe because out_idx <= read_idx always. Good.)

conduit_sse_parser.c — Well-implemented, spec-compliant.

Minor: BOM stripping (lines 152–156) only handles the UTF-8 BOM. The SSE spec only requires UTF-8, so this is correct — but it's worth a comment noting this is per-spec.
No concern with retry == 0 behavior: the spec says retry with value 0 is valid and should update reconnection time, but your code only updates when ms > 0 (line 230). The SSE spec (WHATWG) says the retry field sets the reconnection time including to 0. This is a conformance gap. Fix this →

conduit_json_completer.c — Clean recursive-descent implementation.

Stack buffer overflow risk [conduit_json_completer.c:17]: suffix_buf[128] is the per-completion_t stack buffer for composite suffixes. Deep nesting could cause the suffix to exceed 128 bytes, triggering the silent fallback to just "]" or "}" (lines 199, 275). The behavior is correct (safe fallback) but users might get a malformed output without any error signal. The conduit_json_complete return of -1 would be cleaner here, but the fallback avoids crashing. Consider documenting this in the header.

conduit_line_buffer.c — Excellent implementation with memchr-based scanning and read-pointer advancement (O(1) amortized).

No issues found. The \r\n handling and compaction logic are correct.

Swift Changes

ChatSession.swift (refactored)

The refactoring is clean. History methods moved to ChatSession+History.swift. The new tool-call loop in send(_:) is well-structured.

Thread safety observation [ChatSession.swift:466]: After the tool loop, messages.append(contentsOf: turnMessages) is done under lock. However, turnMessages is built during the loop (outside the lock). If cancel() is called mid-loop, the cancellationRequested flag causes the loop to throw, but turnMessages may contain partial assistant + tool messages that never get committed. This is the correct behavior (clean rollback), but the user message appended at the start is also removed in the error handler. Edge case: if cancel() races with the final withLock commit, the isGenerating flag could momentarily be set to false twice. Not a bug, just a minor code smell.
Potential issue [ChatSession.swift:540-535]: In stream(_:), the config is captured as let currentConfig = config outside the lock (line 540), but config is a public var that could change concurrently. The lock is released before this line. For consistency, config should be captured inside the same withLock block as messages (lines 530–536). Fix this →

ChatSession+History.swift

Clean extraction. All operations are lock-guarded. injectHistory correctly handles system-prompt precedence.
Suggestion: userMessageCount does a linear scan under lock. For sessions with very long histories this could be slow. Not a blocking issue.

WarmupConfig.swift

Well-documented. Static presets .default and .eager are clear.
Minor: WarmupConfig.default uses Swift.default as a raw value via backtick escaping. This works but could cause confusion. Consider renaming to .noWarmup and .eager for clarity, though this is a style preference.

JsonRepair.swift

Correctly uses the C implementation with a Swift fallback.
The capacity calculation at line 82 is utf8.count + maximumDepth + 128. The C function's actual reserved space is effective_max + 2 bytes. The extra 128 bytes of headroom is fine safety margin.
output.withUnsafeMutableBufferPointer is used correctly without assuming the buffer is NUL-terminated before the function returns.

PartialJSONDecoder.swift

The CChar cast at line 65 (UnsafeRawPointer($0).assumingMemoryBound(to: CChar.self)) is the same pattern used in JsonRepair.swift. This is safe since UInt8 and CChar have the same memory layout on all relevant platforms, but it would be cleaner to use withUnsafeBytes + bindMemory.

MiniMaxConfiguration.swift

The anthropicCompatible(apiKey:) extension is a nice touch for dual-protocol support.
Doc inconsistency: The doc comment at line 87 says "The Messages API does not support streaming." but this should be verified against MiniMax's current API docs — if streaming support is added later, this becomes a misleading doc.
The #if CONDUIT_TRAIT_MINIMAX && CONDUIT_TRAIT_OPENAI && CONDUIT_TRAIT_ANTHROPIC guard at line 61 is correct — the Anthropic extension depends on all three traits.

EmbeddingResult.swift

Clean delegation to ConduitCore functions. Dimension mismatch is handled gracefully (returns 0 or .infinity).
baseAddress is force-unwrapped (e.g., line 78: conduit_cosine_similarity(a.baseAddress, b.baseAddress, a.count)). withUnsafeBufferPointer guarantees a non-nil baseAddress for non-empty arrays, but for an empty vector (vector.count == 0), the early guard vector.count == other.vector.count would catch mismatched-dimension cases, but conduit_cosine_similarity with count == 0 returns 0.0f, which is safe. The force-unwrap is fine in practice but a.baseAddress! would be more explicit.

Test Suite

The test suite is extensive and well-structured. A few observations:

Coverage: Tests cover all major providers (OpenAI, Azure, Ollama, OpenRouter, Kimi, MiniMax), core types, C library modules, and extensions. Very comprehensive.
ConduitCoreTests: The C library tests directly call the C functions via Swift bindings. The SSEParserCTests and JsonCompleterCTests cover many edge cases including empty input, UTF-8 sequences, and nesting limits.
DiffusionModelDownloaderTests change (+2/-2) appears to be a minor fix — worth checking what changed there.

Package.swift / CI

ConduitCore is correctly registered as a separate target with Accelerate linked conditionally on Apple platforms.
The Linux CI now runs three jobs: base build+test, macros, and provider traits. Good.
Note: linux.yml runs swift test --traits OpenAI,Anthropic,Kimi,MiniMax without --enable-code-coverage in the third job. Consider adding coverage there too.

Issues Summary

Severity	Location	Issue
Medium	`conduit_sse_parser.c:230`	`retry=0` not handled per WHATWG SSE spec
Medium	`ChatSession.swift:540`	`config` captured outside lock in `stream(_:)`
Low	`conduit_vector_ops.c:56`	Heap allocation in `conduit_euclidean_distance` undocumented
Low	`conduit_json_completer.c:17`	Silent suffix truncation on deep nesting
Low	`MiniMaxConfiguration.swift:87`	Streaming claim may become outdated
Info	`WarmupConfig.swift:131`	`.default` name could cause confusion

The PR is in good shape. The two medium-priority items (retry=0 SSE spec conformance and config capture race) are worth addressing before merge.

claude · 2026-02-19T20:05:21Z

test comment from review

claude · 2026-02-19T20:07:06Z

test2

claude · 2026-02-19T20:07:10Z

PR Review - ConduitCore C Performance Library\n\nGreat direction overall. Offloading hot-path operations to C with Accelerate/vDSP is sound, and the phased design (SSE parser and line buffer added but not yet wired) is sensible. The 79-test suite is thorough. A few issues need attention before merge.

claude · 2026-02-19T20:07:22Z

line1

claude · 2026-02-19T20:07:26Z

claude · 2026-02-19T20:07:32Z

AGENTS.md

Repo Rules

Never commit docs created for planning. If confused, ask which new .md files should be committed.

Corrections From Recent Sessions (2026)

Plans must match the actual package layout, targets, and paths (e.g., WaxCore/Wax) and the Swift Testing framework (Testing). Avoid legacy names like MemvidCore or XCTest unless explicitly requested.
When updating docs/examples, ensure API names reflect current Conduit types (e.g., GeneratedContent, GenerationSchema, Tool, Transcript) and remove legacy StructuredContent/Schema/AITool wording.
Before editing Conduit, confirm you are in the real repo path (not .build/checkouts) and within writable roots; set workdir to the repo root before changes.
If referencing external specs or legacy docs, cross-check with current implementation to avoid mismatches.

Repo Workflows (Verified)

Build: swift build
Full test suite: swift test
Documentation examples only: swift test --filter DocumentationExamplesTests
Text embedding cache tests: swift test --filter TextEmbeddingCacheTests
JsonRepair tests: swift test --filter JsonRepairTests
Anthropic integration tests (skips without key): swift test --filter AnthropicIntegrationTests
Anthropic integration tests (live): ANTHROPIC_API_KEY=sk-ant-... swift test --filter AnthropicIntegrationTests

claude · 2026-02-19T20:09:04Z

Review Part 1 - Critical/High Issues

claude · 2026-02-19T20:09:26Z

PR Review - Part 1/3: Bugs

claude · 2026-02-19T20:12:26Z

PR Review - ConduitCore C Performance Library

claude · 2026-02-19T20:13:28Z

Test comment with newline\nand another line

claude · 2026-02-19T20:25:09Z

PR Review - ConduitCore C Performance Library

Great direction overall. Offloading hot-path operations to C with Accelerate/vDSP is sound, and the phased design (SSE parser and line buffer added but not yet wired) is sensible. The 79-test suite is thorough. A few issues need attention before merge.

Bugs / Correctness Issues

1. Buffer underflow in conduit_json_repair.c - Critical

capacity_for_content is size_t (unsigned). If output_capacity < effective_max + 2 (e.g. capacity=100 with effective_max=256), this underflows to ~SIZE_MAX and the content-copy loop writes far past the end of the buffer. The early guard only checks < 3 but must cover the tail reservation.

Fix: if (output_capacity < (size_t)(effective_max + 3)) return -1;

2. find_context ignores quoted strings - High

The backward scan in conduit_json_repair.c counts raw bracket chars without skipping string contents. Input like {"key": "[value"} causes the [ inside the string to be counted as an array opener, making remove_incomplete_kvp produce incorrect output. A string-aware backwards scan is needed.

3. conduit_line_buffer_next_line silently stalls on oversized lines - High

When a complete line in the buffer exceeds line_out_capacity, the function returns 0 but the line remains unconsumed forever. A caller polling in a loop spins indefinitely. Return -1 to distinguish no-complete-line from line-too-large-for-buffer.

4. conduit_line_buffer_drain lacks NUL-termination - Medium

next_line always NUL-terminates. drain does not. Either document that drain output is raw bytes, or NUL-terminate where capacity permits.

Code Quality / Design

5. Unsafe type-pun in Swift wrappers - Medium

In JsonRepair.swift, EmbeddingResult.swift, and PartialJSONDecoder.swift: reinterpreting [UInt8] baseAddress as CChar via assumingMemoryBound is technically UB in Swift's strict memory model. Prefer: let utf8: [CChar] = json.utf8.map { CChar(bitPattern: $0) }

6. Widened access control on messages, lock, withLock - Medium

Changing messages from public private(set) to public internal(set) allows any module-internal code to mutate it directly without the lock. Add a comment warning internal callers to always use withLock when mutating messages.

7. suffix_buf overflow silently degrades completion - Medium

In conduit_json_completer.c, when the combined suffix exceeds 128 bytes, the inner completion is dropped and only ] or } is appended, potentially producing malformed output. Document this or widen suffix_buf.

CI / Workflow

8. test-with-providers job lacks API key secrets - Medium

swift test --traits OpenAI,Anthropic,Kimi,MiniMax runs with no secret-injection. If provider tests need live credentials, this job always fails.

9. Codecov action should use a token - Low

For private repos, codecov/codecov-action@v4 requires a token input. Without it uploads fail silently.

Minor

PR title "1626" reads as a branch name. The commit message body is excellent; use it as the PR title.
Conduit.swift cleanup and extraction of WarmupConfig / ChatSession+History improve maintainability.

Summary

Severity	Issue
Critical	Buffer underflow in conduit_json_repair.c
High	find_context mishandles brackets inside strings; next_line stalls on oversized lines
Medium	drain missing NUL; type-pun in Swift; widened access control; suffix_buf degradation; CI secrets
Low	Codecov token; PR title

The C implementation is well-structured overall. The three correctness issues above should be addressed before merging.

christopherkarani · 2026-02-20T00:31:54Z

@claude fix all issues
@codex review

chatgpt-codex-connector · 2026-02-20T00:32:06Z

You have reached your Codex usage limits for code reviews. You can see your limits in the Codex usage dashboard.

claude · 2026-02-20T00:32:13Z

Claude finished @christopherkarani's task in 5m 54s —— View job

All Issues Fixed

Changes Made (commit `6340d1c`)

11 files changed, 142 insertions, 61 deletions

`conduit_json_repair.c`

Critical (buffer underflow): Added guard if (output_capacity <= (size_t)(effective_max + 2)) return -1; before computing capacity_for_content. Without this, a small output_capacity caused an unsigned wrap to ~SIZE_MAX, writing far past the buffer.
High (find_context quotes): Rewrote find_context() to use a forward scan with full string-awareness. The old backward scan counted [ and { inside string literals as real openers (e.g. {"key": "[val" was misidentified as array context, producing wrong output from remove_incomplete_kvp).

`conduit_line_buffer.c`

High (infinite loop): Changed return 0 to return -1 in next_line() when a complete line is found but exceeds line_out_capacity. Returning 0 was indistinguishable from "no complete line yet", causing callers in a polling loop to spin forever on the oversized unconsumed line.
Medium (NUL-termination): drain() now NUL-terminates out[to_copy] when to_copy < out_capacity. When the buffer exactly fills out_capacity, no room for NUL exists; this is documented in a comment.

`conduit_sse_parser.c`

Medium (SSE spec): Changed if (valid && ms > 0) to if (valid). Per WHATWG SSE spec §9.2.6, retry:0 is a valid field that must set the reconnection time to 0 ms.

`conduit_json_completer.c`

Medium (suffix_buf): Widened suffix_buf from 128 to 512 bytes. Added a comment explaining that if the combined suffix still exceeds the buffer, the safe minimal ]/} fallback produces shorter-but-valid JSON rather than crashing.

`conduit_core.h`

Low (undocumented allocation): Added a HEAP ALLOCATION NOTE to conduit_euclidean_distance documenting the malloc of a temporary float[count] buffer in the Accelerate path.
Updated next_line header to document the new -1 return case.

`JsonRepair.swift` & `PartialJSONDecoder.swift`

Medium (type-pun): Replaced Array(json.utf8) → json.utf8.map { CChar(bitPattern: $0) } and removed the UnsafeRawPointer($0).assumingMemoryBound(to: CChar.self) reinterpret cast. The resulting UnsafePointer<CChar> matches the C function's expected type without UB.
High (findContext quotes): Rewrote Swift findContext() to mirror the fixed C version — a single forward scan with string and escape tracking maintains a proper bracket stack, replacing the old two-pass approach whose backward scan lacked string awareness.

`ChatSession.swift`

Medium (race condition): Moved let currentConfig = config inside the withLock block in stream(_:). config is a public var that could change concurrently; it must be read atomically with messages.
Medium (access control): Added a Warning doc comment to messages (public internal(set)) reminding internal callers to always mutate inside withLock { }.

Tests added

SSEParserCTests.retryZero — verifies retry:0 sets reconnection time to 0
LineBufferCTests.nextLineOversizedLine — verifies -1 returned (not 0) for oversized line, and line remains unconsumed
JsonRepairCTests.bracketInsideStringValue — verifies [ inside a string value is not treated as an array opener

Not addressed (requires workflow file access)

CI test-with-providers missing API key secrets — cannot modify .github/workflows/ files. Please add the required secrets to the workflow manually.
Codecov token — same reason.
| Branch 1626

C library fixes: - conduit_json_repair.c: Guard against size_t underflow in capacity_for_content calculation when output_capacity <= effective_max + 2 (critical buffer overflow) - conduit_json_repair.c: Rewrite find_context() to use a forward scan with full string-awareness; the previous backward scan counted brackets inside string literals as real openers (e.g. {"key": "[val"} misidentified as array context) - conduit_line_buffer.c: Return -1 (not 0) from next_line when line exceeds line_out_capacity; returning 0 was indistinguishable from "no complete line" and caused callers in a polling loop to spin forever on the oversized line - conduit_line_buffer.c: NUL-terminate drain() output when capacity permits; update header to document raw-bytes behaviour when buffer is exactly full - conduit_sse_parser.c: Accept retry:0 per WHATWG SSE spec §9.2.6; the previous `ms > 0` guard incorrectly rejected a zero reconnection time - conduit_json_completer.c: Widen suffix_buf from 128 to 512 bytes and add a comment explaining the safe-but-minimal fallback on overflow - conduit_core.h: Document heap allocation in conduit_euclidean_distance (Accelerate path only); update next_line header to document -1 return case Swift fixes: - JsonRepair.swift + PartialJSONDecoder.swift: Replace assumingMemoryBound type-pun ([UInt8] → CChar) with a proper [CChar] conversion via CChar(bitPattern:) - JsonRepair.swift: Fix findContext() to use the same string-aware forward scan as the updated C implementation - ChatSession.swift: Capture config inside the withLock block in stream(_:) so it is read atomically with messages; add lock-usage warning to messages doc Tests: - SSEParserCTests: Add retryZero test verifying retry:0 is accepted - LineBufferCTests: Add nextLineOversizedLine test verifying -1 return code - JsonRepairCTests: Add bracketInsideStringValue test for the find_context fix Co-authored-by: Christopher Karani <christopherkarani@users.noreply.github.com>

christopherkarani and others added 3 commits February 18, 2026 19:03

added test

b13e59f

clean

4c5ccf2

Comments

Conversation

christopherkarani commented Feb 19, 2026

Uh oh!

chatgpt-codex-connector bot commented Feb 19, 2026

Uh oh!

christopherkarani commented Feb 19, 2026

Uh oh!

chatgpt-codex-connector bot commented Feb 19, 2026

Uh oh!

claude bot commented Feb 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR Review: Branch 1626 → main

Summary

ConduitCore C Library

Swift Changes

Test Suite

Package.swift / CI

Issues Summary

Uh oh!

claude bot commented Feb 19, 2026

Uh oh!

claude bot commented Feb 19, 2026

Uh oh!

claude bot commented Feb 19, 2026

Uh oh!

claude bot commented Feb 19, 2026

Uh oh!

claude bot commented Feb 19, 2026

Uh oh!

claude bot commented Feb 19, 2026

AGENTS.md

Repo Rules

Corrections From Recent Sessions (2026)

Repo Workflows (Verified)

Uh oh!

claude bot commented Feb 19, 2026

Uh oh!

claude bot commented Feb 19, 2026

Uh oh!

claude bot commented Feb 19, 2026

PR Review - ConduitCore C Performance Library

Uh oh!

claude bot commented Feb 19, 2026

Uh oh!

claude bot commented Feb 19, 2026

PR Review - ConduitCore C Performance Library

Bugs / Correctness Issues

Code Quality / Design

CI / Workflow

Minor

Summary

Uh oh!

christopherkarani commented Feb 20, 2026

Uh oh!

chatgpt-codex-connector bot commented Feb 20, 2026

Uh oh!

claude bot commented Feb 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

All Issues Fixed

Changes Made (commit 6340d1c)

conduit_json_repair.c

conduit_line_buffer.c

conduit_sse_parser.c

conduit_json_completer.c

conduit_core.h

JsonRepair.swift & PartialJSONDecoder.swift

ChatSession.swift

Tests added

Not addressed (requires workflow file access)

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

claude bot commented Feb 19, 2026 •

edited

Loading

PR Review: Branch `1626` → `main`

claude bot commented Feb 20, 2026 •

edited

Loading

Changes Made (commit `6340d1c`)

`conduit_json_repair.c`

`conduit_line_buffer.c`

`conduit_sse_parser.c`

`conduit_json_completer.c`

`conduit_core.h`

`JsonRepair.swift` & `PartialJSONDecoder.swift`

`ChatSession.swift`